Deep Web Annotation Using Goal-Oriented Special Purpose Ontologies (Position Paper)
نویسندگان
چکیده
Current efforts in the semantic web area have not yet led to applications that deliver additional real-life value for web users. The main reason is a vicious circle of low populated concepts, low value for web users as well as missing incentives for annotating data. As a solution a pragmatic approach based on goal-oriented special purpose ontologies derived from the deep web in combination with annotation of instances in the deep web is presented. Finally, a use case is underpinning the idea. Introduction and Motivation Evolving semantic web technologies as well as first semantic web applications, bring forward the vision of a more powerful semantic web. However, real-life value for web users is still lacking. General purpose ontologies such as FOAF, SIOC and others do not yet generate a real addon in comparison to typical web 2.0 pages. Also semantic knowledge bases such as OpenCyc, DBpedia or OpenCalais currently don’t exceed the value of nonsemantic knowledge bases such as Wikipedia. Two reasons account for these facts: First, many concepts but only few instances are semantically described. Imagine a food retail consumer having the goal of creating his weekly shopping list. Concepts such as food item or shopping center don’t really help unless instances of food items and shopping centers are attached to those concepts. Right now, there are no incentives for data owners to annotate their data, since web users don’t use semantic applications. This will last as long as no value is generated by semantic web applications and no value is generated unless instance data is annotated – ending in a vicious circle. The second reason for limited value of current semantic web applications is the lack of special purpose ontologies and is related to the first reason. In order to break the Copyright © 2009, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. vicious circle described above, it is necessary to build semantic web applications that generate value, i.e. enable web users to achieve certain goals with a better performance than using traditional web applications. General purpose ontologies promote a top-down approach for building such applications with the promise of providing a solution for a broad range of goals within different domains. However, the effort for generating a critical mass of annotated instances increases with this approach, ending in web applications with low chances of generating value for a web user. Goal-Oriented Deep Web Annotation In this position paper a pragmatic research framework is presente, for generating limited scope but high value semantic web applications. Instead of using a top-down approach, a bottom up approach is suggested by annotating existing instances to ontologies that are build based on existing data structures on the web. It is estimated that in 2002 the web of fixed web pages (surface web) consisted of 167 TB of data whereas the database driven websites that create web pages on demand (deep web) comprised 91,850 TB of data (Lyman & Varian 2003). By retrieving the underlying concepts as well as the underlying data structures of the deep web, special purpose ontologies can be derived. Based on these ontologies, corresponding instances of the deep web can be retrieved and annotated, leading to highly populated concepts. Highly populated concepts lead to a high TermRank (Ding et al. 2005) making it easy to find the concept (e.g. with semantic search engines such as Swoogle) and increasing reuse of the concept. A similar approach has been proposed and studied earlier (Handschuh et al. 2003a, Handschuh et al. 2003b, Volz et al. 2004). However, in addition to these studies it is suggested that in order to create semantic web applications with high real-life value, the deep web should be annotated in a specific enough, human-centered domain. The NAICS
منابع مشابه
بررسی هستان شناسی های توسعه یافته مبتنی بر اصول هستان شناسی های منبع باز زیست پزشکی
Background and Aim: Ontologies facilitate data integration, exchange, searching and querying. Open Biomedical Ontologies (OBO) Foundry is a solution for creating reference ontologies. In this foundry, the design of ontologies is based on established principles which allow for their interactions as a single system. The purpose of this study is to determine the main features of ontologies develop...
متن کاملCollaborative Ontology Development with Inheritance Support
Modern development of huge ontologies in domain of crucial importance is inconceivable without appropriate means for effective collaboration between knowledge stakeholders. The paper presents an approach for collaborative ontology development using inheritance of classes and their properties, relations and metadata. Collaborative actions for maintenance of ontology content are supported by a sp...
متن کاملA Tool for Working with Web Ontologies
The task of building an open and scalable ontology browsing and editing tool based on OWL, the first standardized Web-oriented ontology language, requires the rethinking of critical User Interface and ontological engineering issues. In this paper, we describe Swoop, a browser and editor specifically tailored to OWL ontologies. Taking a ”Web view” of things has proven quite instructive and we di...
متن کاملStudy of Design Issues on an Automated Semantic Annotation System
The semantic annotation process turns ordinary HTML web pages into machine-understandable semantic web pages. We have proposed a semantic annotation system to automate annotation of web pages by ontologies. In this paper, we present a study of five design issues on this system, which include: (1) the covering of web pages; (2) the general paradigm of annotation process; (3) the measurements of ...
متن کاملA Framework for Managing Multiple Ontologies: The Function-Oriented Perspective
Ontologies are now ubiquitous in Semantic Web and knowledge representation areas. Managing multiple ontologies is a challenging issue including comparing existing ontologies, reusing ontologies, maintaining different versions, and so on. However, most previous multiple ontologies management work focused on ontologies maintenance, evolutions, and versioning. They ignored the very important point...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009